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GARCH is one of the most prominent nonlinear time series models, both widely applied and 
Li^ . thoroughly studied. Recently, it has been shown that the COGARCH model (which was intro- 

ry^ ' duced a few years ago by Kliippelberg, Lindner and Mailer) and Nelson's diffusion limit are 

the only functional continuous-time limits of GARCH in distribution. In contrast to Nelson's 
diffusion limit, COGARCH reproduces most of the stylized facts of financial time series. Since 
C^ ' it has been proven that Nelson's diffusion is not asymptotically equivalent to GARCH in defi- 

ciency, in the present paper, we investigate the relation between GARCH and COGARCH in 
Le Cam's framework of statistical equivalence. We show that GARCH converges generically to 
COGARCH, even in deficiency, provided that the volatility processes are observed. Hence, from 
a theoretical point of view, COGARCH can indeed be considered as a continuous-time equiva- 
^ ' lent to GARCH. Otherwise, when the observations are incomplete, GARCH still has a limiting 

experiment, which we call MCOGARCH, which is not equivalent, but nevertheless quite similar, 
to COGARCH. In the COGARCH model, the jump times can be more random than for the 
MCOGARCH, a fact practitioners may see as an advantage of COGARCH. 
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1. Introduction 

^^ I Since the seminal papers of Engle [10] and Bollerslev [4] the discrete-time GARCH 

H ' methodology has become a widely applied tool in the modeling of heteroscedasticity 

- - -' in financial times series. On the other hand, continuous-time models are very useful, for 

instance, in option pricing, as shovifn by Black and Scholes [3] and Merton [21], in the 
analysis of tick-by-tick data and for modeling irregularly spaced time series. 

In the 1990s, researchers tried to bridge the gap between continuous and discrete 
time. Nelson [23] showed that an appropriately parametrized GARCH can be seen as 
a discrete-time approximation of a bivariate diffusion model on an approximating time 
grid. However, this diffusion model docs not capture most of the so-called stylized facts 
reflecting empirical findings in financial time series: for example, volatility exhibits heavy 
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tails, jumps upward and clusters on high levels. To overcome the shortcomings of the 
diffusion model, Khippelberg et al. [17] introduced a new continuous-time GARCH model 
which they called COGARCH. In contrast to the bivariate diffusion, this model exhibits 
many of the stylized facts. We refer the reader to Fasen et al. [12] for an extensive 
discussion of the stylized facts and the various competing volatility models proposed in 
the literature. 

Recently, Kallsen and Vescnmaycr [16] and Mailer et al. [20] have identified COGA- 
RCH as a functional limit of GARCH in distribution. Most notably, Kallsen and Vcsen- 
mayer [16] have argued that Nelson's diffusion and COGARCH are the only possible 
limits of GARCH in distribution in a scmimartingalc setting. 

The passage from discrete to continuous time has an obviously appealing practical 
purpose: we can estimate the underlying continuous-time model parameters by a time 
series formulation and plug them into the continuous-time limit for other purposes. As 
argued by Wang [28], such a passage is, in general, only justified if the corresponding 
statistical experiments converge in Le Cam's framework of deficiency (see Le Cam [18], 
Le Cam and Young [19] and Strasser [27]). 

In particular, Wang [28] (see also Brown et al. [6]) showed, assuming independent 
Gaussian innovations, that Nelson's diffusion approximation of GARCH is not valid in 
deficiency: the innovations encounter the two models in an intrinsically different way. 
Whereas GARCH is driven by onc-dimcnsional innovations, its diffusion limit is driven 
by planar Brownian motion. 

In contrast to Nelson's approximation, COGARCH is driven by a Levy process of 
only one dimension, thereby mimicking one of the key features of GARCH. Naturally, 
the following questions arise: Are the approximations of COGARCH by GARCH, as 
proposed by Kallsen and Vesenmayer [16] and Mailer et al. [20], also valid in deficiency? 
Does the limiting model depend on the underlying sampling scheme? 

Dealing with Le Cam's distance in deficiency is a challenging task. In particular, asymp- 
totic equivalence results for dependent data arc very scarce; see Dalalyan and Rcifi [8] 
for an overview. Further obstacles arise from the intrinsic heteroscedasticity of GARCH. 
Therefore, in this paper, we restrict ourselves to compound Poisson processes as driving 
Levy process and assume that the innovations are randomly thinned. This approximation 
scheme also occurs in the papers [16] and [20]. Random thinning is a standard limiting 
procedure in many other areas of probability theory and statistics. In particular, we men- 
tion the peak-over-threshold method in extreme value theory (see Remark 2.1(ii)). In 
contrast to our approximation scheme, most papers on statistical equivalence deal with 
aggregated innovations where the experiments are compared to Gaussian shift experi- 
ments (see Brown and Low [5], Nussbaum [24], Grama and Neumann [13] and Carter [7] 
and references therein; see also Milstein and Nussbaum [22] with potential applications 
to time series analysis). We point out once again that for the GARCH, aggregated inno- 
vations lead to the diffusion limit investigated by Nelson [23] and Wang [28]. 

The paper is organized as follows. Section 2 contains our main results. We introduce the 
experiments and sampling schemes in Section 2.1. In Section 2.2. wc construct a limiting 
experiment for randomly thinned GARCH with unobserved conditional variances. As 
shown in Section 2.3, using both theoretical and numerical methods, this experiment 
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is generically not equivalent to COGARCH. However, if the conditional variances are 
observable in full, then all experiments are generically (asymptotically) equivalent to 
COGARCH - this is shown in Section 2.4. We conclude in Section 3. In Sections 4-7 
we give the proofs of all theorems and propositions from Section 2. Section 4 contains 
the proof of Theorem 2.1, Section 5 the proof of Theorem 2.2 and Section 6 the proof of 
Theorem 2.3. The proofs of all propositions in Section 2.4 are reported in Section 7. In 
the Appendix, we review some of the basic notions of Lc Cam's convergence in deficiency. 

2. Main results 

2.1. GARCH-type experiments in discrete and continuous time 

For all n G N we consider an rt-dimensional vector Z„ = {Zn^k)i<k<n with distribution 

C{Zr,)^{{l-pn)eo+PnQnf", (2.1) 

where, for all n eN, p„ G (0, 1) and Qn is a probability measure on the Borcl field S(M). 
Here, eg denotes the Dirac measure with total mass in zero. 

The parameter p„ modulates our random thinning. In accordance with the law of rare 
events, we assume that the following limit exists in (0,oo): 

7= lim np„ e (0,oo). (2.2) 

In the sequel, we will encounter several GARCH-type processes, all indexed by 6* € 
[0,00)''. In discrete time, processes will be further indexed by n G N and a suitable 
parametrization. Throughout this paper, a parametrization is a pair {&^(H„)neM)y 
where 6 is a non-empty subset of [0,oo)'', and for all n G N, iJ„ is a mapping 
Hn = (^o,Ti, /3ri7 ckri, Ki) : 6 — > [0, oo)"'. Here, hg (/io,n(^)) denotes the unknown initial value 
of the volatility Hq, which is treated as an additional unknown parameter in this paper. 
For the corresponding continuous-time limits, a is the mean reversion parameter of the 
volatility processes and /3/a the asymptotically stable fixed point of the (unperturbated) 
volatility SDE (ODE). A is a scaling parameter for the corresponding jumps of the volatil- 
ity processes. 

For a parametrization (O, (iJ„)„gN), we consider the sequence of partial sums corre- 
sponding to a randomly thinned GARCH model indexed by G O and n G N, defined 

by 

G„(fc) = G„(fc - 1) + hl/\k - l)Z„.fc, G„(0) = 0, 

hn{k) = Pn{e)+a.n{e)h„{k - 1) + A„(0)/i„(fc - 1)^^ ,., (2.3) 

/^„(0) = VnW, l<k<n, 9ee, 

where Hn{9) = {ho^n{0) , Pn{S) , ctn{S) , A„(0)) for all 6 £ Q. Note that this specification of 
a GARCH does not quite follow the traditional one, but enumerating the indices generates 
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the same processes. Also, observe that the definition of {Gn,hn) in (2.3) depends on the 
choice of (6, (7J„)„gN)- 

Provided that Q„ converges weakly to some probability measure Q, the limit in (2.2) 
sets up convergence in distribution of X^tLi ^n.k to a compound Poisson process with 
rate 7 and jump distribution Q as n — >■ 00. For a choice of (0, (i?„)„gN) it is thus natural 
to ask whether the limit of (G„([nt]),/i„([ni]))o<t<i in distribution exists along Hn{9) 
as ri -^ 00 for fixed 6 E Q. In [16] and [20], such paramctrizations have been success- 
fully constructed. Moreover, the corresponding continuous-time limit equals COGARCH 
driven by a compound Poisson process. 

COGARCH is a process {G,h) = (G'(t), /i(i))o<t<i that is indexed by 6* = {hQ,l3,a,X) € 
[0, 00)^ and determined as the unique pathwise solution of the following system of integral 
equations: 

G(i)= / h^/^{s-)zN{ds,dz), 

J\0,t]xS. 

(2.4) 
h{t) = ho+ f3 ~ ah{s-) ds + X h{s-)z^N{ds,dz), 

J[0,t] J[0,t]xK 

where A^ is a Poisson point measure on [0, 1] x M with intensity 7^® Q. 

In the sequel, we restrict our analysis to the two following sampling schemes: 

• Incomplete observations: only G and G„ {n G N) arc observable in full, whereas the 
corresponding volatility processes, h and h„ (n € N), are unobservable. 

• Complete observations: both processes {G,h) and {Gn-,h„) are observable in full. 

We will deal with these two sampling schemes in Sections 2.2-2.3 and Section 2.4, respec- 
tively. Not surprisingly, a simpler theory is in place in the case of complete observations. 
In a more realistic scenario, where observations of the volatility processes are not avail- 
able, results are more difficult due to the nonlinearity of (CO)GARCH. 

Throughout the paper, the space of right-continuous functions g : [0, 1] — > Mf^ with 
left limits on [0, 1] is denoted by Dd- We endow Dd with the cr-algebra Vd, generated 
by point evaluations (see Billingsley [2]). Furthermore, let M.d be the space of all non- 
negative point measures on [0, 1] x W^ with finite support. We equip this space with the 
cr-algebra Md generated by the point evaluations A h^ fJ-{A), A e B{[0, 1] x M''), ^ g Mq 
(see Reiss [26], pages 5-6). 

The trace of the Borel field in M = (RU {—00, 00})'' with respect to A C R is denoted 
by B{A). The Lebesgue measure on B{M.) and the Dirac measure with total mass in x 
are denoted by £ and e^, respectively. If (E,A) is a measurable space and X is a random 
element taking values in {E,A), then its distribution is denoted by C{X). Whenever 
this distribution depends on a parameter 9, we employ the notation Ce{X). If (Ei,Ai), 
i= 1,2, are measurable spaces and X : Ei ^ E2 is A1/A2 measurable, then ^-^ denotes 
the image of a measure fj, under X . 

We refer to the Appendix and [27] for unexplained notation relating to convergence in 
deficiency. 
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2.2. Limit experiments of GARCH (incomplete observations) 

In this subsection, we assume that the volatihty processes are unobservable. To pursue 
our program, we introduce another class of processes. Let {G,h) = {G{t) , h{t))o<t<i be 
the unique pathwise solution of the following system of integral equations: 

G{t)= f h^/'^{s~-)zN{ds,dz), 

J[0,t]xM 

(2.5) 
h{t)^ho+ f l3~ah{s^)dTN{s) + X [ h{s-)z^N{ds,dz), 

J[0,t] J[0,t]xS. 

where 9 = (/io,/3,a. A) G [0,00)^^. Here, TiMi -^ -Di, ai-^T^r is defined as follows: If, for 
some 771 G N, = to < ii < • • • < tm < 1 and xi, . . ., Xm £ K, <t G M admits a representation 
of the form a = X^kLi ^{tk,xk)i where = to < ti < • • • < t,„ < 1, then we set 



TAt) = / *'' - + -, te[tk-i,tk),l<k<m, 

m[tk-tk-ij m 

T,{t) = —^—^ T + 1' t^[tn^^]■ 



(2.6) 



If such a representation does not exist, then we set T'cr(t) = t for all t G [0, 1]. 

We call (G, h) the MCOGARCH, an acronym referring to modified COGARCH. To 
illustrate the difference between COGARCH and MCOGARCH, we next consider a sim- 
pler representation of G (we will return to (2.5) in our analysis in Section 2.4). 

To this end, let ly — (i^(t))o<t<i be a Poisson process with rate 7 and (Zk)k<£N be 
a sequence of independent random variables, independent of ly. By solving the integral 
equations for h in (2.5), we observe that 

Ce{G)^Ce(Y.K{lk.,zX (2.7) 

where, for k,m eN, k> 2, we set 



hm.k.9 



-(1 - e-"/") + e-"/'"/i„,,_i,e[l + AZ|_i] 



(2.8) 



a 

/im.i.e — — (,t — e ' j + e ' /lo 
a 

for 9 — (ho, [3, a, A) G [0, oo)'*, a > 0, with the convention that ^g, = 0. Here, we extend 
the definition of hm^kfi to 9 = (/lo, /?, 0, A) G [0, 00)^ by taking a 1 in (2.8). 

In view of (2.8), note that the magnitudes of the jumps of G (in space) depend on their 
multiplicities and the sizes of innovations, but not on their arrival times. This attribute 
is not shared by COGARCH. Therefore, it is, to some extent, justified to speak of G 
and G as experiments driven by two and three sources of randomness, respectively: the 
number of jumps, the innovations and the arrival times. 
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As no information about the volatility processes is assumed in this subsection, we 
consider the following experiment of MCOGARCH type: 

£^iDi,Vu{Cg{G))g^[0.o.r). (2.9) 

For a parametrization (G, {Hn)n<£N)-, we consider the corresponding GARCH experiments 
in discrete time, 

f„,H„(e) = (R",6(M"),(/:9(G„))e6e), neN, (2.10) 

where, for n S N, G„ = (G„(fc))i<fe<„ is defined by (2.3) via the parametrization 
(9, (7J„)„gN)- We write Sn,H,^ =^n,H„{&), provided that we have Q = [0,00)"* in (2.10). 
Next, we give a GARCH parametrization such that the randomly thinned GARCH 
converges strongly to the MCOGARCH experiment £ in deficiency; therefore, we pick 
6 = {ho,l3, a. A) S 6 and n e N. If a > 0, then we set 



(2.11) 



h^°l{0) = hoe-"/- + ^{1- e-"/"), I3i'\e) = ^(1 - e-"/"), 
a a 

al^^ (9) = e""/", Ai°) (9) = Ac""/" 
and, otherwise, if a = 0, then we set 

CW-/^o + §, (^i^HO)-^^, ai^HO) = l, A(")(0) = A. (2.12) 



Let {[0,ooY, (Hn )) be the corresponding parametrization and G„ the corresponding 
partial sum processes of GARCH in (2.3). 

Although the parametrization in (2.11) and (2.12) is quite elaborate, we show that the 
corresponding GARCH experiment converges to the experiment of MCOGARCH-type 
with no restrictions on the limiting probability measure Q assumed (see Section 4 for 
a proof). 

Theorem 2.1. Let (2.2) be satisfied for some 7 € (0,oo) and p„ € (0, 1), 71 G N. // Qn 
tends to a probability measure Q in total variation as n — )■ c», then £ „(o) converges 

strongly to £ in deficiency as n — )■ 00 . 

If Q is absolutely continuous with respect to the Lebesgue measure, then Theorem 2.1 
partially extends to other GARCH parametrizations (see Section 5 for a proof of the 
following theorem). 

Theorem 2.2. Let (2.2) he satisfied for some 7 G (0, 00) and p„ G (0, 1), n G N. Suppose 
both that Qn tends to a probability measure Q in total variation as n —> 00 and that 
Q<£. _ 

Let 7^ with compact closure O in (0, 00) x [0, 00)'^. For n G N, let Hn ~ (/^o,n, PmOtn, 
A„) : O — > [0, 00)^ be a GARCH parametrization and G„ the corresponding GARCH model 
in (2.3). 
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// there exist tiq G N and C > such that, for all n>nQ, both 

sup inax{|Vn(^)-/io|,|A„(0)-A|}<- (2.13) 

9=iho,p,a,\)ee n 



and 



then 



sup niax{|n/3„(6l)-^|,|n(a„(6')-l)+a|}<C, (2.14) 

= {ho,f3.a,\)ee 



lim sup||£e(G„)-/:e(GW)|l=0 (2.15) 

and £n,H„{^) converges strongly to £{0) in deficiency as n ^ c». 

Remark 2.1. (i) Let Q = Qn for all n g N. In Kallsen and Vesenmayer [16] and Mailer 
et al. [20], the GARCH parametrizations (0, (i?„ )TieN) and (0, (iJ„ )«£») have been 
considered where, for = {ho, 13, a. A) S (0, oo)^ x [0, oo), using the obvious notation, 8 = 
(0,oo)^ X [0,oo) and 

'"' (2 16~) 

ar^(0)=a(r)(e)=e-"/", 

A(fv)(0) = A, Ai^)(^) = e-"/"A. 

Kallsen and Vesenmayer [16] have shown that (G„[7i-], ft,„[n-]), as defined in (2.3) by 
Hn{&) = Hn (9), converge to COGARCH with parameter d in (2.4) in law, with respect 
to the Skorokhod topology, as n — >■ oo, for all G 9. 

Mailer et al. [20] have encountered a slightly different scenario. For 6 £ Q, they have 
embedded a sequence of GARCH models into a given COGARCH and obtained the con- 
vergence with respect to the same topology, now driven by a general Levy process, even in 
probability. If the driving process is a compound Poisson process with rate 7 and jump 
size distribution Q, then their analysis relates to a situation where the corresponding 
partial sums have the same law as (G„[n-],/i„[n-]) under the parametrization 7J„ (9), 
eG&,neN. 

In short, it follows from the analyses in [16] that, for fixed 6' € 0, the partial sum 
processes of GARCH with parametrization Hn ' (9) converge to COGARCH in law, as 
n— ^ 00, with a similar result being true for the parametrization Hn (9) in [20]. On the 
other hand, both parametrizations fall into the framework of Theorem 2.2. Hence, if the 
distribution of the innovations admits a Lebesgue density, then the limiting experiment 
is given by MCOGARCH £{&) rather than COGARCH £{0). 
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(ii) In part (i), Q = Qn does not depend on n. Potential applications where Qn de- 
pends on n arise in the peak-over-threshold method in extreme value theory; see, for 
instance, Embrechts et al. [9], Resnick [25] and Falk et al. [11]. Here, Qn equals the laws 
of rescaled innovations, conditioned on the event that they exceed a given threshold. Un- 
der reasonable assumptions, Qn converge weakly to a generalized Parcto distribution Q 
as n — > oo. Also, the corresponding GARCH models converge in distribution in law to 
a COGARCH driven by a compound Poisson process with jump distribution Q. In this 
sense, COGARCH serves as a good approximation of GARCH in law if we arc interested 
in the extreme parts of the innovations. On the other hand, if Qn converges to Q, even in 
total variation norm, then it follows from Theorem 2.1 that the corresponding limiting 
experiment must be statistically equivalent to MCOGARCH. 

2.3. COGARCH vs. MCOGARCH (incomplete observations) 

In this subsection, we investigate whether the experiments induced by COGARCH and 
MCOGARCH are of the same type. Here, we again assume that the volatility processes 
are unobservable. Therefore, we recall (2.4) and consider the experiment 

f = (Z?i,2?i,(£e(G))ee[o.oo)4). (2.17) 

Note that both experiments £ and £ depend on the intensity measure -fi (E) Q which 
enters (2.4) and (2.5) via N. In this subsection, we include this dependence in our notation 
by writing £^^q and £j^q instead of £ and £, respectively. 

Let / : M — >■ (0, oo] be a strictly positive probability density with respect to Lebesgue 
measure and set 

^C{h) := h^ f f{hzff{zf-i dz, /i > 0, C e (0, 1). (2.18) 



JS. 

By Holder's inequality, gj^^ defines a function gj^^ : (0, oo) -^ (0, 1] with g/,^(l) = 1. Note 
that gf^c, satisfies both a scaling and a reflection property: for all < C < 1, a, /i > 0, 

5a/(a.).cW=ffAcW: 5/,C W = 3/.1-C (V/^) • (2-19) 

Next, we investigate how COGARCH relates to MCOGARCH in deficiency (see Section 6 
for a proof). 

Theorem 2.3. Let 7^ O C (0,oo) x [0,00)"^. Assume that Q admits a strictly positive 
Lebesgue density f such that for some (^q G (0, 1) , gfXo ■ (0; °^) ^ Pi 1] *•' strictly increas- 
ing on (0, 1]. 

Let (7,i)„gN Q (0, 00) be a sequence such that 7 = lim„_j.oo 7ri exists in [0, 00) and 7„ 7^ 7 
for all n e N. _ 

If £'y„,Q{Q) is equivalent to £~i^.q{Q) for all 71 e N, then we have: 

(i) if (/io,i, /3, a. A), (/io,2, /3, a. A) G 6 and /3 > 0, then Hq^i — /io,2; 
(u) i/(/io,/3i,a,A),(/io,/32,a, A) e e, then(3i=(i2; 
(iii) ?/ (/io,/?,Q!i, A), (/io,/3,a2, A) € O andj3 = Q, then ai =a2; 
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(iv) if (fto, /?,«!, A), (/iO)/3)Ct2, A) S Q and a.\ = 0, then a2 ~ 0; 
(v) «/ (/io,/3,ai, A), (/io,/3,a2, A) G andai<a2, then hg > jS / ai . 

Theorem 2.3 indicates that equivalence of MCOGARCH and COGARCH is restricted 
to parameter sets that are of considerably lower dimension and which have non-empty 
interiors. Hence, we do not have equivalence in deficiency. 

Observe that ( i—^ gfx{f^) occurs as the Hellinger transformation of the scaling experi- 
ment {M.,B{M.),{£{Z),C{Z/h)}), where Z is a random variable with Lebesgue density /. 
Next, we verify the monotonicity property of (?/,^(ft-) in a number of examples. 

Generalized symmetric gamma distribution. Let a, 5, c > and F be Euler's gamma 
function. Assume that / : R — >■ (0, oo] has the following form: 

This class of distributions covers important special cases such as the normal distribution 
with zero mean and the Laplace distribution. It follows straightforwardly that 



h"'^ 



c/b 



^f^<^'^= Wc + ii-C) ) ' o<C<i,/^>o. 

Observe that gf^^ : (0, oo) — > [0, 1] is strictly increasing on (0, 1] for all < C < 1 and thus / 
satisfies the monotonicity assumption of Theorem 2.3. 

Centered Cauchy distribution. Let a > and fa{z) = — ttt'-^ be the density of the 
centered Cauchy distribution Cauchy(0, a) with scaling parameter a. By the scaling prop- 
erty in (2.19), we have 3/^,i/2(^) = gfi,il2{h) for all h>Q. By differentiating this under 
the integral sign, we obtain 

d^5/„,i/2W = ^^y^ (1 + (l + ;,2),/;, + ,2)3/2 >0- a>0,0<h<l. 

Consequently, the centered Cauchy distribution satisfies the monotonicity assumption of 
Theorem 2.3. 

Next, we present a simulation-based approach to assess non-equivalence. This approach 
can be used in cases not covered by Theorem 2.3 (or when it is not clear whether the 
assumption of Theorem 2.3 is satisfied). Recall that statistical equivalence of the experi- 
ments £ and £ is implied (see [27], Theorem 53.10) when, for all finite subsets 9 C [0, oo)'' 
and all 6q ^ Q, we have 

-((^).J-4(^).J- 

We generated samples from these two distributions according to the recursion (6.1) in the 
proof of Theorem 2.3 in Section 6. To this end, we first restricted the parameter space 
to a set with two elements, 6o and 6. While fixing 6*0 to (2,1,1,0.1), we have chosen 
eight vectors 9ij, i = 1, ... ,4:, j = 1,2, for the parameter vector 9, which differ from 9q in 
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Table 1. Choices of do and 6 = 9ij in equation (2.20) 



^0 


2 


1 


1 


0.1 


^11 


0.4 


1 


1 


0.1 


9l2 


10 


1 


1 


0.1 


021 


2 


0.2 


1 


0.1 


6»22 


2 


5 


1 


0.1 


031 


2 


1 


0.2 


0.1 


6*32 


2 


1 


5 


0.1 


6»41 


2 


1 


1 


0.02 


042 


2 


1 


1 


0.5 



only one component; see Tabic 1. Second, we checked the distributional equality (2.20) for 
three different jump distributions: the standard normal, the standard Cauchy distribution 
Cauchy(0, 1) (for comparison, note that both of these are covered by Theorem 2.3) and 
the normal mixture distribution 

iiV(-0.5, 0.75) + iiV(0.5, 0.75), 

which has mean and variance 1. The intensity 7 was always fixed to 4. 

For each of the eight pairs (0Oj ^y ) and each of the three jump distributions, we gener- 
ated 10'^ samples of the two distributions referring to the COGARCH and MCOGARCH 
in equation (2.20). The left-hand column of Table 2 reports the choice of 9ij, whereas the 
other nine columns report, for each of the three jump distributions, the 25% quantile, 
median and 75% quantile of the distribution in equation (2.20). 

Next, we applied the Wilcoxon rank sum test (also known as Mann-Whitncy test) 
to investigate the null hypothesis the median of the likelihood ratio for the COGARCH 
experiment equals the median of the likelihood ratio for the MCOGARCH experiment. 
Table 3 reports the values of the Wilcoxon test statistic W, together with the corre- 
sponding p-values. For each jump distribution, the first column corresponds to a sample 
size of 10^, the second to 10^ and the third to a sample size of 10^ per experiment. 
Obviously, the p-values tend to as the sample size increases. Based on 10^ samples, 
the null hypothesis is most significantly rejected, for all three jump distributions and all 
eight parameter vectors 9ij. In other words, there is strong evidence that in the case of 
incomplete observations, the randomly thinned GARCH and COGARCH experiments 
are not statistically equivalent for these jump distributions. This confirms our conjec- 
ture, that Theorem 2.3 holds in a much more general formulation for quite arbitrary 
jump distributions. 

2.4. Complete observations 

In the previous subsections, we investigated both convergence and equivalence in defi- 
ciency of a variety of GARCH- type experiments under the assumption that their volatility 
processes /i„, h and h are unobservable. In this subsection, we deal with the situation 
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Table 2. Estimated 25% quantiles, medians and 75% quantiles for tiie distributions in (2.20) 





COGARCH 


















MCOGARCH 
















Jumps 


iV(0,l) 
25% 


Median 


75% 


Caucliy(0, 1) 
25% Median 


75% 


Mixed iV 




Quantiles 


25% 


Median 


75% 


Sn 


0.1081 


0.5560 


1.3888 


0.5521 


0.7775 


1.1767 


0.0909 


0.5329 


1.3918 




0.1785 


0.6977 


1.3495 


0.5884 


0.8226 


1.1811 


0.1558 


0.6743 


1.3543 


0l2 


0.1505 


0.3152 


0.6449 


0.4173 


0.8127 


1.4573 


0.1436 


0.3008 


0.6136 




0.1637 


0.3377 


0.6768 


0.4412 


0.8335 


1.4505 


0.1575 


0.3264 


0.6559 


6*21 


0.8326 


1.0168 


1.1711 


0.9273 


0.9761 


1.0393 


0.8307 


1.0201 


1.1766 




0.7605 


1.0114 


1.2459 


0.9051 


0.9566 


1.0539 


0.7560 


1.0155 


1.2512 


6*22 


0.4883 


0.7071 


1.0086 


0.7765 


1.0229 


1.2130 


0.4797 


0.6956 


1.0000 




0.4201 


0.6077 


1.0000 


0.7010 


1.0247 


1.2676 


0.4100 


0.5988 


0.9798 


6*31 


0.6928 


0.8543 


1.0621 


0.8497 


1.0000 


1.1506 


0.6863 


0.8476 


1.0530 




0.6304 


0.7841 


1.0629 


0.8029 


1.0000 


1.1881 


0.6248 


0.7757 


1.0524 


6*32 


0.0053 


0.1702 


1.1056 


0.3853 


0.6449 


1.1172 


0.0028 


0.1392 


1.0856 




0.0010 


0.0590 


0.9129 


0.3093 


0.5650 


1.1090 


0.0005 


0.0437 


0.8703 


^41 


0.9864 


1.0104 


1.0735 


0.8265 


1.0000 


1.0798 


0.9863 


1.0114 


1.0762 




0.9884 


1.0100 


1.0693 


0.8357 


1.0000 


1.0779 


0.9884 


1.0109 


1.0722 


O42 


0.6851 


0.8870 


1.0000 


0.6217 


0.9328 


1.0418 


0.6750 


0.8802 


1.0000 




0.6963 


0.8942 


1.0000 


0.6281 


0.9360 


1.0388 


0.6865 


0.8874 


1.0000 



where the corresponding volatihty processes are observable in fuU. Of course, this sit- 
uation is mainly of theoretical interest and will primarily help us to learn about the 
structural connections between GARCH and COGARCH. However, we want to briefly 
mention some modern approaches by which the unobservability of the volatility process 
can be dealt with in practice. For example, there are several modern ways to estimate the 
local volatility directly; see, for example, A'it-Sahalia, Mykland and Zhang [1] and refer- 
ences therein or Jacod, Kliippelberg and Miillcr [15], who use local volatility estimates 
also in a COGARCH context, and many others. The paper by Hubalek and Posedel [14] 
contains another very interesting idea. They use martingale estimating functions to es- 
timate the parameters in the Barndorff-Nielsen-Shephard model, which is composed of 
a stochastic differential equation (SDE) for the log prices and another SDE for the vari- 
ance. However, the martingale estimating functions approach requires that both processes 
can be observed. Hence, Hubalek and Posedel [14] reinterpret the volatility equation as 
an equation for some other observable measure of trading intensity (such as trading vol- 
ume or the number of trades), assuming that the instantaneous variance process behaves 
(up to a time- independent constant) exactly as the observable trading volume (or the 
number of trades). As they show in their real-data example, this approach leads to quite 
satisfying results. The same idea could be used, of course, for the COGARCH model, to 
bypass problems with the unobservability of the volatility process in practice. 
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Table 3. Wilcoxon rank sum test: Values of Wilcoxon test statistic W and corresonding p- values 





W statistic 


















p- value 


















Jumps 


N{Q,1) 






Caucliy(0 


,1) 




Mixed N 






Sample size 


10* 


10^ 


10"^ 


W^ 


lo-"- 


10^ 


10* 


10^ 


10" 


sii 


-7.10 


-24.12 


-73.91 


-8.82 


-25.46 


-73.81 


-8.11 


-24.01 


-71.91 




0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


912 


-3.04 


-14.90 


-47.21 


-0.35 


-2.73 


-9.90 


-6.04 


-15.37 


-48.40 




0.0024 


0.0000 


0.0000 


0.7245 


0.0064 


0.0000 


0.0000 


0.0000 


0.0000 


021 


-1.56 


-3.20 


-12.52 


8.71 


31.90 


98.90 


-0.45 


-3.45 


-14.05 




0.1189 


0.0014 


0.0000 


0.0000 


0.0000 


0.0000 


0.6545 


0.0006 


0.0000 


922 


12.10 


44.13 


136.09 


-1.92 


-2.69 


-8.28 


14.17 


45.16 


141.15 




0.0000 


0.0000 


0.0000 


0.0546 


0.0070 


0.0000 


0.0000 


0.0000 


0.0000 


e-si 


12.38 


37.96 


116.09 


1.76 


2.16 


10.30 


12.07 


38.89 


119.95 




0.0000 


0.0000 


0.0000 


0.0788 


0.0311 


0.0000 


0.0000 


0.0000 


0.0000 


032 


11.34 


39.48 


126.96 


11.63 


33.04 


100.34 


13.59 


42.66 


131.75 




0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


Oil 


0.83 


1.41 


5.79 


-1.52 


-2.85 


-4.98 


2.83 


3.64 


5.35 




0.4054 


0.1572 


0.0000 


0.1280 


0.0044 


0.0000 


0.0047 


0.0003 


0.0000 


Oi2 


-2.96 


-3.71 


-13.73 


-1.29 


-2.35 


-2.94 


-1.63 


-4.81 


-15.26 




0.0031 


0.0002 


0.0000 


0.1963 


0.0189 


0.0032 


0.1041 


0.0000 


0.0000 



Returning to theoretical matters, we now consider the following GARCH-type experi- 
ments in continuous time with fully observed volatilities, denoted by 

£h = (-D2,2?2, ('C0(G,/i))0g[o,oo)4), £h = {02,1^2, ('Ce(G,/i))eg[o^oo)-*), 

where h is defined by the specification in (2.5) and (2.6). Similarly to Sections 2.1 and 2.2, 
where we dealt with continuous time, both experiments £h and £h depend on Q and 7 > 
as well. In this subsection, we will suppress this dependence in our notation. 

We need to specify a set Q^ ^ [0,00)"* of exceptional points in the parameter space 
[0,00)* as follows: 

^{9 = {ha,l3,a,X)£ [0,oo)'^ -.hoa- 



e. 



:/3}. 



(2.21) 



Observe that 6e is closely connected to the fixed point of the affine differential equation 
h'{t) = (3 — ah{t). Indeed, if 6* = {ho, /3, a, A) € 8e, then we have h{t) = h{t) = ho for all 
t e [0,T), where T is the first jump of (M)COGARCII. It is impossible to recover the 
parameters /?,a, A in full within the time horizon [0,r). Otherwise, if ho is not the fixed 
point of this differential equation, then it is always possible to recover parts of 9 by 
taking appropriate derivatives. In the next proposition, we formalize this idea and show 
that both £h and £h are equivalent to a simple reference experiment (see Section 7.1 for 
a proof). 
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Proposition 2.1. //Q({0}) = 0, then both £h and£h are equivalent to T = {[0,oo]'^,B{[0, 
oo]"*), {Qe)eelo,ooy) where, for 6 = (/io,/?,a, A) G [0,oo)^, 7 > 0, we set 




e"''£(/jo,oo,oo,oo) + (l-e~'^)£e, 6* € ee,/io > 0, A > 0, .^^^. 

''£(ho,oo,oo,cx,) +(1 -e~'^)£(/jo,oo,oo,0), 6* € Be, /lo > 0, A = 0, 

oo.oo.oo); " G '^ei /^O — 'J; 

and 0e is the set defined as in (2.21). 

Remark 2.2. In the situation of Proposition 2.1, we require Q to satisfy <5({0}) = 0. 
Indeed, liQ — e^^ then it is easy to see that both £h and £h are equivalent to J^, where we 
formally set 7 = in (2.22). Otherwise, if Q({0}) S [0, 1), then we may adjust the intensity 

measures of the driving Poisson measure accordingly, to see that both £h and £h are 
equivalent to F , but with 7 replaced by 7(5(K\{0}) in the definition of Qg. Analogously, 
one can adjust the discrete-time experiments that we consider in Proposition 2.2. We 
leave the details to the reader. 

Next, we investigate the discrete-time experiments. Note that the initial value of h is 
observable in continuous time. As a result, it is always possible to recover the parame- 
ter Hq in full. To account for this phenomenon in discrete time, we introduce the following 
sequence of experiments. £h,n.H„, indexed by ri G N, where 

f/^,n,//„ = ([K"+i]2,6([R"+i]2),(£e(G„,/in))ee[o,oo)^), neN. (2.23) 

Here, ([0, 00)'*, (Hn)) is a parametrization of the full parameter space [0, 00)"'; both G„ = 
(G„,fc)o<fc<n and/i„ = {hn,k)o<k<n aredefinedby (2.3) viai?„(6') = {ho^n{6),l3n{9),an{9), 
A„(6')) for n G N and 9 G [0,oo)^ (by a slight abuse of previous notation). We are now in 
a position to state a discrete-time analog of Proposition 2.1 (see Section 7.2 for a proof). 

Proposition 2.2. Suppose that (2.2) is satisfied for some 7 € (0,oo) and p„ e (Oil)? 
n G N. -Let ([0,00)*, iJ„)„gN be the parametrization in (2.11) and (2.12). Also, tet([0,oo)^, 
Hn )neN and ([0,oo)^,iJ„ )neN be the parametrizations in (2.16), respectively. 

If ©({O}) = Qn({0}) = for all n S N, then the following assertions hold as n ^ cx), 
both in deficiency: 

(i) £h.n,Hn converges strongly to T ; 
(ii) both £ zj(^v) and £ (m) are asymptotically equivalent to 

J'„ = ([0,oo]'*,S([0,oo]'^),(Qe^„)eg[o,oo)4), 

where, for n S N and 9 ~ (/io,/3,a. A) G [0,oo)^, we define Qe.n as Qg in (2.22), but 
with Qe replaced by 

&e,n = {0^ {h„, (3, a, A) e [0, <x)^ : hon{l - e""/") = /3}. 

Finally, we are concerned with the relationships between the experiments J- and J>i, 
n e N U {00} (see Section 7.3 for a proof). 
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Proposition 2.3. Let 7 > and 7^ C [0,oo)^. Let T ^Tn, n e N, he the experiments 
in Propositions 2.1 and 2.2, respectively. 

LetF= (]Q,oo\'^,B{\0,ooY'), ((9e)ee[o.oo)'') be the experiment where, for 9 = {ho,P,a,X) E 
[0,00)^, we define Qe as Qq in (2.22), but with Oe replaced by 

Ge ^ {Oy X (0, 00) X [0, 00) U [0, 00) X {0}2 X [0, 00). 
The following assertions then hold: 

(i) 6{T{e),T{e)) = S{T{Q),Tn{Q)) = for all n e N; 
(ii) (5(J"(e), J'(e)) = i/ and only if for all Hq > 0, 



{{13, a, A) e [0, cx3)3 : {Hq, /3, a, A) e 9 n Ge n ef } 7^ 

=> #{(/3,a)e[0,c5o)2:3A>0 (/io,/3,a, A) e Oe n 6} = 1; 



(2.24) 



(iii) liin„_).oo (5(J>i(0), J-'(0)) = i/ anrf on^j/ i/ t/iere exists some uq such that for all 
n>nQ and Hq > 0, (2.24) holds, but with Qg replaced by Qe.n- Ln particular, Tn converges 
weakly to T as n -^ 00 in deficiency. 

Let us rephrase our results in terms of the GARCH experiments, with the volatiUty 
processes fully observed in both continuous and discrete time. In contrast to the situation 
in Theorem 2.3, it follows from Proposition 2.1 that the continuous-time experiments 
induced by (M) COG ARCH are mutually equivalent in deficiency. Depending on the 
parametrization, (M)COGARCH also occurs as the limit in deficiency of discrete-time 
GARCH; in particular, this is the case for the parametrization in Proposition 2.2. In 
contrast to Theorem 2.3, for a large class of parameter sets 9, all of these discrete- 
time experiments, that is, £^^^(o)(9), £^^^(kv)(8), £'^^^(m)(0), are asymptotically 

equivalent to (M)COGARCH £h{Q) and £'/i(0), in deficiency, as n— >oo. This happens, 
for instance, if C [0,00)'' does not contain an open neighborhood of &e- Since the 
set Op is of lower dimension than [0,oo)^ it is thus justified to say that the randomly 
thinned GARCH is generically equivalent to COGARCH in deficiency as n — > cx). 

3. Conclusion 

In Le Cam's framework, Wang [28] and Brown et al. [6] investigated GARCH and Nel- 
son's diffusion limit. These authors dealt with aggregated Gaussian innovations. For 
a suitable parametrization. Mailer et al. [20] and Kallsen and Vesenmayer [16] showed 
that the GARCH model converges to the COGARCH model in probability and in dis- 
tribution, respectively, when the innovations are randomly thinned. These papers dealt 
with a general Levy process as the driving process of the COGARCH. In this paper, we 
have studied an important special case in Le Cam's framework of statistical experiments, 
namely, we have assumed that the driving process of COGARCH is a compound Poisson 
process. GARCH then converges generically to COGARCH, even in deficiency, provided 
that the volatility processes are observed. Hence, from a theoretical point of view, COG- 
ARCH can indeed be considered as a continuous-time equivalent to GARCH. Otherwise, 
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when the observations are incomplete, GARCH stih has a hmiting experiment, which we 
call MCOGARCH, but this will usually not be equivalent to COGARCH in deficiency. 
Nevertheless, this limiting experiment is, from a statistical point of view, quite similar 
to COGARCH since the only difference is the exact localization of the jump times. For 
COGARCH, the jump times can be more random than for MCOGARCH, but practi- 
tioners may see this as an additional advantage of COGARCH. 

It would be interesting to extend the analysis to more general Levy processes, rather 
than Brownian motion and compound Poisson processes. However, this would first require 
substantial investigations of the approximation and randomizations of Levy processes 
themselves and, therefore, seems out of reach at the present stage of research. 

4. Proof of Theorem 2.1 

For the reader's convenience, we first provide a brief roadmap for the proof of Theo- 
rem 2.1. The proof is split into two parts, which appear in Sections 4.1 and 4.3, respec- 
tively. The second part uses a lemma which we formulate and prove in Section 4.2. To 
prove that £^ ^(o) — >■ £ in deficiency, we will introduce intermediate experiments £j* „ 
and £2 n- The first of these two experiments corresponds to a deterministic time grid, the 
latter to a randomized time grid. First, we will show that £ „(o) is equivalent to £* ^ 
in deficiency and then, using Lemma 4.1 from Section 4.2, that fjn converges strongly 
to £. Finally, we will prove that £^ ,^ and £3 « ^^^ equivalent. 

4.1. Proof of Theorem 2.1 (part I) 

For n e N, define a point measure A^i.„ on [0, 1] x K by 

n 

^i." = Hl2-,fc7ioe(fc/n.z„,.): n€N. (4.1) 

fe=i 

Using A^i,,i, we pass from discrete to continuous time. For n S N, define 

•^Li = {Di,'Di, {C0{Gi^n))0e[o,ooy), 

where, for all < i < 1, n G N and 9 = (/io,/3,a. A) G [0,oo)^, {Gi^n,hi^n) is the unique 
pathwise solution of the following system of integral equations (t G [0,1]): 

hi^n{t) = ho+ f3 ~ ahi^n{s-) ds + X hi^n{s-)z'^Ni^n{ds,dz). 

J[0,i] J[0.t]xR 

Fix = (/io,/3,a. A) € [0,oo)'' with a ^ 0. By solving the linear ODE for /ii,„ in (4.2), 
observe that 



a ' \ n J 



(4.3) 
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for (fc - l)/n < i < k/n, l<k<n and n G N. It thus follows from (2.11) and (4.3) that 
for all 77, e N, 

/ii,„(l/7i-) = /loc-"/" + ^[1 - e-"/"] - /io,„(0), 

a 

f^hn('^A =l3n{O) + hiJ^—^-\an{O) + \n{0)Zlk_,], 2<k<n. 

In view of (2.3) and the identities in the last display, we thus have h„{k) — /ii.„(((fc + 
\)ln)-) for aU 7ie N, < fc < n- 1 and 9 = {ho,l3,a,X) G [0,cx))'' with a > 0. A similar 
argument is applicable to (2.12) and 9= (/io,/3,0,A) G [0,oo)^. It thus follows from (2.3) 
and (4.2) that 

/:9((Gi,„(fc/77))i<fe<„)=£e((G„(fc))i<fc<„), 77gN,0g[O,oo)4. 

Note that Gi,„ is constant on [{k — l)/n,k/n), 1 < k <n and tt, G N. Hence, £ „m is 
equivalent to £i ^ in deficiency for all 7i G N by (A. 2) and the monotonicity theorem for 
Markov kernels (see [26], Lemma 1.4.2(i)). 

Next, we randomize the deterministic time grid. Therefore, let {Uk)keN be an i.i.d. 
sequence of random variables independent of the vector Z„ , where Uk is uniformly dis- 
tributed on [0,1]. Set 

Vr,^k = {{k-l) + Uk)/n, l<k<n, (4.4) 

and define a point process A^2.n by 

n 

iV2,„ = ^lz„,,^oe(y„,,,z„,,), neN. (4.5) 

fc=i 

Let T be as in (2.6). For n G N, let £^„ == {Di,Vi, (/:e(G2,„))0e[o,oo)*): where for ah 
0<i<l,7iGN and 9 = {ho, 13, a, X) G [0,oo)^, (G2,n,'i2,n) is the pathwise unique solution 
of the following system of integral equations: 

G2,„(t)=/ hl/^is~)zN2.n{ds,dz), 

JfO.tlxR 

(4.6) 
h2,n{t) = ho+ /3-a/i2,n(s-)drAr2„(s) + A / /i2,,i(s-)z^7V2,„(ds,dz). 

J[0,t] J[0,t]xR 

To proceed with the proof of Theorem 2.1, we need the following lemma. 
4.2. Statement and proof of Lemma 4.1 

Lemma 4.1. Let N be a Poisson measure with intensity measure "/i® Q and N2.n as 
in (4-5). Suppose that (2.2) holds. If Qn tends to Q in total variation as n— >cxj, then 
\im^^^\\C{N2,n)-CiN)\\=0. 
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Proof. Suppose that (2.2) is satisfied for n S N, p„ G (0,1) and 7 G (0,cx)). Let 
Bn^i-, ■ ■ ■ , Bn^n be independent Bernoulli variables with parameter p„. Suppose that 
iUk,Cn,k)k&m is an i.i.d. sequence of random vectors with independent components, 
where Uk is uniformly distributed on (0, 1) and C{(^n.k) ~ Qn- Suppose that Bn.i, • ■ • , Bn.n 
and (C/fc,Cn,fc)fceN are independent. Observe that 



C{N2,n) = ciY,Bn,ke 



(v„,fc,c„,fc) 



\k=l 



with Vn.k = (fc - 1 + Uk)/n for all n G N and 1 < fc < n. 

Let Nn be a Poisson measure on [0, 1] (g) R with intensity measure npni O Qn and define 



iV„,fc(B)=iV„ Bn 



fc-1 fc 



Be6([0,i]xM). 



Nn^i, ■ ■ ■ ,Nn.n are then independent Poisson point processes, where for all n G N, 
1 < fc < n, A'ji^fc has intensity measure 



npn[£(g>Qn]iBn 



fc-1 fc 



Bg6([0,1]xM). 



By the monotonicity theorem for Markov kernels (see [26], Lemma 1.4.2(i)), observe that 
for all n G N, 



||£(iV2,„)-£(iV„)||< 



fc=i fc=i 



(4.7) 



Denote the Hellinger distance between two probability measures Pi and P2 by iJ(Pi, ^2)- 
This gives us the following upper bound (see [26], Section 1.3, equation (1.23), and 
Section 1.3, equation (1.25)): 



)C{Bn,k£(V„^^,U,k))- 



\C{Nn 



k=l 



fc=l 



\fc=l k=l / 



<{'J2H\C{Bn,keiy^,,,U^,))X{Nr,,k)) 

\k=l / 



1/2 



(4.8) 



Fix ri G N and 1 < fc < n. Let (Ki.fc.i, Cn,fc,0'eN be an i.i.d. sequence of random vectors 
with C{yn,k,iXn,k.i) = 'C(14,fc) ® Qn, I G N. Supposc that (14,fc,i) is independent of Bn,k 
and T„_fe, where Tn^k is a Poisson variable with parameter p„. We then have the following 
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identities: 

By [26], Lemma 1.4.2(ii), for n G N and 1 < A: < n, we must have 

HiC{Bn,kS^v„.,.z^.,)),C{Nr,,k)) < H{C{Br,^k),C{T„M)). (4.9) 

As H{C{Bn,k),C{Tn,k)) < 3^/^ (see [26], Theorem 1.3.1(ii)), it follows from (4.7)-(4.9) 
and (2.2) that 

limsup ||£(^„) - C{Nn)\\ < limsup(3?ip,2j^/2 ^ q (4 ^q) 

In view of a well-known upper bound of the laws of Poisson point measures in terms 
of the corresponding intensity measures (sec [26], Section 3.2, equation (3.8)), it follows 
from (2.2) and \\i®Q\\ = 1 that 

||£(iV„) - CiN)\\ < 3||7£® Q - npJ^QJ 

< 3|np„ — 7I + 3np„||Q — Q„|| ^0 as n — )■ 00. 

By means of (4.10) and (4.11), this completes the proof of the lemma. D 

4.3. Proof of Theorem 2.1 (part II) 

Let A^ be a Poisson measure with intensity measure 'yi(E)Q.lt follows from (2.5) and (4.6) 
that there exists a family of deterministic Markov kernels Kg : Mi xVi —>■ [0, 1], indexed 
by e e [0, oo)4, such that both Ce{G2,n) = KeC{N2,n) and Ce{G) = KeC{N) for all n e N 
and 6' G 8. Since we have assumed (2.2), the assertion of Lemma 4.1 holds and we thus get 
from (A. 4) and the monotonicity theorem for Markov kernels (see [26], Lemma 1.4.2(i)) 
that as n — >■ cx), 

A(£,£*J< sup ||£e(G)-/:,(G2,„)||<||/:(A^)-£(A^2,n)|H0. 

Consequently, £"2 ji converges (strongly) to £ in deficiency as n — > 00. Recall that £ (o) 
is equivalent to £^^ in deficiency for all n S N. To complete the proof of the theorem, it 
thus suffices to show that £{ „ is equivalent to £3 n • 

Therefore, let Mq denote the space of all non-negative point measures on [0, 1] with 
finite support. We equip this space with the cr-algebra AAq generated by point evaluations 
(see Reiss [26], pages 5-6). Let Mo.i C Mo be the subset of point measures a € Mq such 
that there exist 777 G N and = fo < ii < ■ ■ • < ^m < 1 with a = X^fcLi ^tk- For a G Mq, 
wc define mappings Ti^o.,T'2,ct : [0, 1] -^ [0,oo) and r3_(^,T4^cr : [0, 1] x R ^ [O,cxo) x R as 
follows: if cr G Mo\Mo,i, then for aU t G [0, 1] and x G R^ we set Ti^„{t) = T2,^(i) = t 
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and T3_cr{t,x) = T4^a-{t,x) = {t,x). Otherwise, if cr S Mo.i, then there exist m G N and 
= io < ti < • • • < t„i < 1 with a = X^fcLi £*& ^-^d '^6 set 

TiAt)^ u'*!^ — T + -' te[tk-utk), l<k<m, 

m[tk-tk-i) m 

TiAt)^ ,!~^7 V + 1' ie[tm,i]. 

In this case, we define T4,^:[0,l] x M ^ [0,1] x M by T^.^ = {Ti,„{t),x). Then, 
Ti,^ : [0, i™] ^ [0, 1] and T^^c ■ [0, i^] x M ^ [0, 1] x R are bijections and we let Ta,^ : [0, 1] -^ 
[0,i,„] and Ta^o- : [0, 1] x M — >■ [0,im] x M be their corresponding inverses. 
Let n e N. Recah (4.4) and set 

n 

Afi,„ = 2^ey„,fclGi,„(fe/ri)-Gi,„((fe-l)/ri)#0, 
fe=l 

A^2,n= 2J '^([t"] + l)/«lG2,„(t)-G2,,.(t-)#0- 
0<t<l 

For n e N and i = 1, 2, it follows from the transformation theorem that 

J[0,t]xE 



hi.n°T^,Mi„it) = ho+ /3 -a{h,,„oT,,M,„){s-)dTi^Mi„is) (4-11) 

i[o,t] 

+ A / (/i,,„oT,_A/,,„)(s-)2^A'"„'^^''"''"(ds,dz) 

J[0,t]xR 

for alHe [0,1] and 0= {ho,P,a,j) £ [0,cx))'*. 

Let 61 = {ho, P, a, 7) G [0, cx))"^. If /iq = /3 = 0, then it follows from (4.2), (4.6) and (4.11) 
that hi^n ~ hi^n ° Ti^Mi „ = 0, z = 1, 2, a.s. and thus 

Ce{Gi^n) ^Ce{Gi^n°Ti^M,,,J =£0, neN,i^l,2. 

Otherwise, if /iq +/3 > 0, it follows from (4.2) and (4.6) that ft,i,„(i) > for ah t € (0, 1] a.s., 

1 = 1,2. In this case, we have Mi^„ = N2.n, ^h^n = Ni,n, ^i."^""''" = ^n,2 and N^^n'^'" = 
Nn^i and thus we get from (4.11) that both 

^e{Gi^n) = ^e{G2,,i°T2,M2.n) and £e(G2,„) =-Ce(Gi,„ oTi_a/j_^) 

for n G N. In other words, for all n G N, there are Markov kernels Ki2.n : I?i x 2?i — > [0, 1] 
and K2,i.n :-Di x 2?i ^ [0, 1], not depending on 9 £ [0,00)"*, such that Ki^2,nC-e{G2,n) = 
^e{Gi^n) and K2,i.n^e{Gi^n) = ^e{G2,n) for all 0£ [0,oo)^. Hence, f^^ is equivalent 
to £2 n ill deficiency by (A. 2) for all n G N. This completes the proof of the theorem. 
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5. Proof of Theorem 2.2 

The proof of Theorem 2.2 is spht into two parts, reported in Sections 5.1 and 5.4, re- 
spectively. We will need two additional results, which appear as Lemmas 5.1 and 5.2, 
together with their respective proofs in Sections 5.2 and 5.3. 

5.1. Proof of Theorem 2.2 (part I) 

Recall that Le Cam's distance is a pseudo-metric. In view of (A. 4) and Theorem 2.1, it 
thus suffices to show (2.15). For n eN, let Z„ = {Z„.k)i<k<n be a random vector with 
a distribution as in (2.1). 
First, we assume that 

Qn = Q, neN. (5.1) 

At the end of the proof we will relax this condition to ||Q„ — Q\\ — >0 as n— >oo. 

Let Nn be as in (4.1) and set \\Nn\\ = N„{[0, 1] x M), n e N. Let 9 be as in the assertion 
of the theorem. Suppose that Hin — Hn = (^o,i,ni/?i,n,Qfi^„, Ai^„) :0 — ?> [0,oo)^ satisfies 
the assumptions of the theorem. Further, let i?2.n = {ho,2,n,l32,n,ct2,m A2,n) = Hn : — > 
[0,00)'' be defined by the identities in (2.11) and (2.12). 

For 6* e e and i = 1, 2, we define Xi^n = {Xi^n{k))i<k<n by 

X^Ak) - <{f (fc - l)Zn,k, ^z,n(0) = 0, 

Kn{k) = A.n(^) + Kn{k ~ l)[a,,„(0) + A,,„(0)Z2_J, (5.2) 

KrM - ho,rM0), neN,l<k<n. 

Hence, Xi_„ corresponds to the GARCH processes Gn as in the theorem, and X2,n to 
the GARCH processes G„ defined directly after (2.12). Let 



< cr = (cr;)l<;<fc £ N'= : ^ cr, < n i , 



Mn,k^<<J = {ai)i<i<keN'':22<^i<n>, l<k<n,neN. (5.3) 

By employing the conventions 0° = 1 and X)™ fc ^ ^ for tti < fc, we set 

m=0 

^..n,3,^<.(^) = A,-„(0)[a,,„(0)]"'+i-i 
for a = {ai)i<i<k G Mn^k, I < k <n, <l < k ~ 1, i = l,2 and n G N. 
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Also, we recursively define functions from R'^ — > R by setting 



(Ti-2 



m=0 

(5.5) 

gi,n,l,a,eiy) = m,nS,U<i{d) + m,n,2,l,'y{d)gi,nd-lM,e{y) + 'ni.n,3U,a{d)yf 

for j/eR'', a^{ai)i<i<k & M„^k, 1 < fc < n, < / < fc - 1, i = 1,2 and n e N. 

Let n G N and 1 <k <n. On {|| A^n|| = fc}, we consider the following stopping times: 

To=0, Tm = mm{iy e {T„,-i + l,...,n}:Z„^„^0}, 1 < m < fc. 

Using these stopping times, let At ~ ((AT,„)i<m<fc) £ Mn^k be the random vector defined 
componentwise by Ar™ = r™ — t^-i for 1 <m < k. 

Let i = 1, 2, n g N, l<k<n and 6* e 9. On {||iV„|| = 0}, set Yi^n = 0, and otherwise, 

^i,n = (^i,n(0)l<i<||A'„|| — {^i,n{n))l<l<\\N„\\- (5-6) 

In the notation of (5.4) and (5.5), li_„ satisfies the following recursion on {||iV„|| = fc}: 

^^,n(0=.9,>G-l)^n,r,, 1^:,„(0)=0, 1 < / < fc, 

5i,n(0 = Vi:n,l,LATi9) +gj,ri('- l)??i,n,2,;,Ar (^) 



+ Vi.n,3J,Ar{(^)gt,n{l- ^) Z^ r, ^ ^ < I < k - 1, 



(5.7) 

1 <- ; <r I. _ 1 

ffi,ri(0) = ,9i,,i,0,Ar,e- 

Recall (5.3). For aU n e N, 1 < fc < n and cr = (fT;)i<;<fc G M„^fe, let 

An,k^, = {\\Nj=k,AT = a}. (5.8) 

For future purposes, we collect some useful inequalities in the next lemma. 

5.2. Statement and proof of Lemma 5.1 

Lemma 5.1. Suppose that {&,{H„)nefi) satisfies the assumptions of Theorem 2.2. Let 
S € (0,cxj) and suppose that Q([—S,S]) = 1. 

There then exist some C = C{S, 8) G (1, oo) and uq = nQ{S, O) G N such that the fol- 
lowing three inequalities hold: 

C 

\9l,n,0,a,e — ff2,n,0,CT,e| < — , (5.9) 

n 

kn,iMy)>c-\ (5.10) 

Ee[\gi,n,l,a,e{YlA - g2,n,l,cr.e{Yl,n)\\A„.k.a] < — (5-11) 

n 
for all n> no, 1 <k <n, <l <k - I, a e MnM, « = 1,2, G 8, y G R*-' and i = 1,2. 
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Proof. Let (9, {Hn)ne.n) be as in Theorem 2.2. First, note that (6, (-ffi,ra)„GN) = 
(6, {Hn)ne.n) satisfies the assumptions of Theorem 2.2. Also, recall that (0, {H2,n)nen) = 
(6, {Hn )rieN) is defined in (2.11) and (2.12). In particular, observe that ai^„(0) -^ 1 uni- 
formly for all B = (/iQi /3, a, A) € 6 as n — )• od, i = 1, 2, and thus there is an n\ = ni(0) e N 
satisfying 

e-i 

^- < [a.,«(6')]" = cxp(nlog[n + n(a,,„(6l) - 1)] - nlogn) < 2c (5.12) 

for all n > ni, i = l,2 and 9 = {ho, (3, a, A) £ Q. 

It follows from our assumptions on (0, (-ff„)„eN) that there exist ng = "■o(Q) ^ 't-i and 
Ci = Ci(e) G (1, oo) such that 



2 ai,„(6/) 



e 



-1 



(5.13) 



> —r inf _/io > C*! 



-1 



4 (/io,/J,Q,A)ee 



and 



yeR.'' 



max(/io.,,„(0),/3..„(0), K„(0)]", ^^2i^^l < Ci, 
max{| Vi,«(^) - ho,2AS)\, |A,„(0) - /32,„(e)|, (5.14) 

|ai,„(e) - a2.„(e)|, |Ai,„(0) - A2,„(0)|} < — 
for all n > uq, 1 < k < n, < I < k — 1, a ^ icri)i<i<k G -^^»i,fe, i = 1,2, G 6 and 
Recall (5.4) and (5.5). It follows from (5.14) that wc have 

ma.x{l']i,n^2,l,aiO),rii.n,3,l,aiO)} < Cf, 

rRa,yi{rji^n.l,l,aiO),gi,n,0,afi} <{k + l)Ci, 

max{|[ai,„(0)]™ - [«i,„(0)]"|} < ^, (5.15) 

max{|77i,„,,,,,,(0)-772,n,j,;,a(^)|:j = l,2,3}<^ — ^^, 

\gi.n,0,a,e — 92,n,0,cr,e\ < 



rj 



for all n > uq, l<k<n,0<l<k-l,a= {(7i)i<i<k G Mn^k, i = 1, 2, m G No and 9 eO. 

Recall (5.7). Let S" > 1 be such that Q([-S',S']) = 1 and set C2 = C2{S,9) = e2(l + 

S')^Cf and C3 = 03(8, 9) = S^C2- It follows from an induction and the inequalities 
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in (5.15) that 

Ee[g^AlMn,k,a] < Cf{k + 1) + Cl{l + S'')Eg[g„,{l - l)\A,,^kA 



(5.16) 



< (fc + 1) ^ (1 + 52ym^2(l+m) ^ ^fe 
Tn=0 



and thus that 

Eg[Y^^^{l)\A„^uA<C^z (5.17) 

for all n > no, l<fc<n, 0</<fc~l, cr = (ci)i<'<fc ^ ^^n.fci J = 1,2 and e 8. 

Finally, let C = C(S', 0) = Uc^C^Cz. By an induction, it follows from (5.15)~(5.17) that 

Ee[\gi,n,l,a,e{Yl,n) — gi,n,l,afi^\,ii)\\An,k,o\ 
3 

< C'a X! l^i'"'J''(^) ~ '^2,n,j,;(6')| + £'e[|.gi,„,;-i.a,e(yi,n) - 52,n,;-i,^,e(>l,«)P«,A:,rT] 

fc-l 3 „j, 

< |ffl,n,O,<T,e-.92,n,O,(T,0| + C*! ^ ^ |ryi,„.j,; (6*) -r/2,„jv(6')| < — 

(=1 i=l 

for all n > tiq, 1 < /s < n, < / < fc — 1, u = (cr;)i</<fc € Ai„^fc and 6 £ 0. This completes 
the proof in view of (5.13) and (5.15). D 

5.3. Statement and proof of Lemma 5.2 

We now provide an upper bound for conditional laws and their total variation norms in 
the next lemma. 

Lemma 5.2. Suppose that Q adm,its a Lehesgue density f , where f is globally Lipschitz 
and has a compact support {/ > 0}. 

// (0,(-ff„)„gN) satisfies the assumptions of Theorem 2.2, then there exist uq = 
no(/, e)eN andC = c\f, 8) G (0, oo) such that 

\\llQ{YxAAn,k.a) - Ce{Y2AJ^nM.o)\\ < — (5-18) 

n 
for all 6 ^Q, n> hq, 1 <k<n and a G Mn,k- 

Proof. By assumption, we have f{x) = for all \x\ > S and some S > 0. Hence, there 
are no = n^{f,9) G N and Ci = Ci{f,9) G (l,oo) such that for C replaced by Ci, the 
assertion of Lemma 5.1 holds. 

Let n > riQ, i = 1,2, 6 £ Q, 1 < k <n and a G Mn,k- Recall (5.5). In view of (5.10), 
^i^n,e'-^'' — >■ K*' is a well defined C°° diffeomorphism, where ^i^n,a,e = {'4'i,n,i,a,e)'^<i<k ■ 
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is defined by 



1pi,n,l,cr,e{,y) 



yi 



.1/2 , s 

9i,n,l-l,afi\V) 



(5.19) 



for 2/ = (j/i, . . . , j/fc) e M'^ and 1 < Z < fc. For all n > no, 6 £ Q, n> no, 1 < fc < n and 
cr G Mn^k, we define 



fi,n,k,crM{y) =11 ~n" 



fi'>p2,n,l,a,diy)) 



1=1 9i,n,l-l,a.0yy) 



yeR'',i = l,2. 



It follows from (5.5), (5.7) and (5.19) that fi^n,k,<y,e is a density of the probability measure 
C-e{yi,n\An,k,a) with respect to the Lebesgue measure H®'' on S(]R'^). In particular, we 
must have 



\(^eiXl.n\An,k,a) — -^6/ (^2,™ |^n,fc,, 



\Il,n,k,a,8{y) - f2,n,k,c7,eiy)\dy (5.20) 



for all e 9, n > ng, 1 < A; < n and a £ Mn,k- 

Suppose that C/ G (0,C!o) is a global Lipschitz constant of /. By means of simple 
substitutions, for all e > and WjV > e, we can observe that 



f{x/v) f{x/w) 



dx<-{S^Cf + l)\v-w\. 



V w 

Consequently, for all e > 0, we find a ki = ki(/, e) G (1, oo) such that 



fix/v) f{x/w) 



dx < Ki{e)\v — w\, v,w>e. 



In view of (5.10), there thus exists some K2 = K2(/, ©) G (l,oo) such that 

<iyi 



fiyi/9i!n,i-i,aAyy^ /(yi/ff2,n,/-i,^,e(y)) 



5l.n ;-l,cr,8(y) 



92.nA~l,a,e\y) 



< fi2\gi,n,l-l,a,e{y) - g2,n,l-l,c,Av)\ 



(5.21) 



for all n > no, l<fc<n, l</<fc, crG M„.k, y G K'^ and G O. By integrating over yk, 
we get from (5.21) that 



1 



\fi,n,k,a,e{y) - f2.n,k,c7,eiy)\dy 



< K2 



fe-1 



f{-ipl,n,l,a,e{y)) 



TT — 

'-' 1=1 9l,n,l-l,aAy) 



/2 



\gi,n,k-i,a,eiy) - g2,n,k-i,a,e{y)\ dy (5.22) 
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dy 



n f{lpl,n.l,a.e{y)) TT f {lp2,n.l,a.e{y)) 
-1/2 / N 11 -1/2 / \ 

1=1 9l^n,l-l,aM\V> 1=1 92,n,l-l,a,eyy) 



(5.23) 



for all n > no, 1 < k <n, a E A/„_fc and 6 E Q. It follows from (5.11) that 
/ 11 ~n2 -— l5i,",fc-i,a,e(y) - g2.n,k~i,a,d[y)\dy 

= Eo[\gi,n,k-l.e{Yl,n) — g2.n,k-l.e{Yl^n)\\An,k,a] < 

n 

for all n > no, 1 < k <n, a E Mn^k and G 9. 

Let C = eK2Ci. By inductfon, wc thus get from (5.9) and (5.22)-(5.23) that 

\\Ce{Yi,^\An.k,a) - Cg{Y2.n\A^,k.a)\\ < — , 

n 

uniformly for all n>nQ,l<k<n,a£Mn.k and OeQ. This completes the proof of 
the lemma. D 

5.4. Proof of Theorem 2.2 (part II) 

Let / be a Lebesgue density of Q and Q be as in Theorem 2.2. We denote the positive 
part of a function g : R ^- M by 5+ . Let C^ be the space of infinitely often continuously 
differentiable functions g ; M ^ K with compact support {g > 0}. As C^ is dense in L^, 
we find a sequence of gm G Cq' , m G N, such that / \gr,i ~ f\d£ ^i' as m — >■ 00. It is 
immediate that both J \g^ — f\d£ —>■ and J g^d£ —?' 1 as tti ^- 00. Without loss of 
generality, we may thus assume that J 3+ d^ > for all to e N. Then, h,n ;= 3^/ / g^ d£ 
defines a sequence of globally Lipschitz continuous probability densities with a compact 
support {/i,„ > 0} such that / \h„i — f\d£—> 0. 

For TO € N, let Z„ = (^„™ )i<fe<n be a random vector with distribution 



£(4'"))(B) = ("(1 -p„)£o(S) +p„y /i„d£ 



with B e S(R"), TO,n e N, 1 < fc < n. If we replace Z„,fc by z';''^} in (5.2), then we get 

yet another family of GARCH models, X^^' = {Xl"^'{k))i<k<n, say, indexed by 6* e 0, 
i = 1, 2 and m, n £ N. 

It follows from the monotonicity theorem for Markov kernels and a well-known upper 
bound for product measures (see [26], Lemma 1.4.2(i) and page 23) that for all i = 1,2, 

sup\\CeiX,,,)-Co{X^/;^)\\<\\C{Z.n)-C{Zir^)\\ 
eeeo 

. (5.24) 

< niJ£(Z„,i) - £(zi';j))|| = ^ / \K, - /I dt 
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As h,n is globally Lipschitz with a compact support {h,n > 0} for all m G Nq, the assump- 
tions of Lemma 5.2 hold. For all m S N, there thus exist rim G N and Cm = (^{hm,®) € 
(0,c») such that for all n > Um, we get, by conditioning and the monotonicity theorem 
for Markov kernels, that 

sup||£,(x(;::') - ce{xif)\\ < -i?[cr"ii] (5.25) 

see n 

for A^„ as defined in (4.1). Recall (5.2). By combining (5.24) and (5.25), we get, from the 
triangle inequality, that 



sup I 
see 



,|l£e(G„) - £fl(Gi°))|| < np„ / \hm - f\ d£+ ispr-ll] 

for all 771 G N and n > Um- As (2.2) holds, we have lim„_j.oo ECm " = e^*^*-^™"^^ and thus 
limsupsup||/:e(G„)-£e(G(°))||<Alimsup /" |/i„ - /| d^ = 0, 

giving (2.15). This completes the proof of Theorem 2.2 in the case where Qn = Q for all 
neN (see (5.1)). 

Now, assume that Qn — > Q in total variation norm as n — >■ oo. For to € N, let Z„ = 
iZn,k)i<k<n bc a raudom vector with distribution 

C{Z„) = {{l-Pn)eo +p„Q„)®", n G N. 

If we replace Zn^k by Z„_fc in (5.2), then we get the GARCH models in the assertion of 
the theorem. We denote them by Xi.m ?^ € N, i = 1, 2. By the same argument as in (5.24), 
we must have, for all 1 = 1,2 and n G N , 

sup WCeiX^^n) - Cg{X^^n)\\ < npn\\Qn - Q\\- 

eeso 
As the right-hand side tends to zero, this completes the proof of the theorem. 

6. Proof of Theorem 2.3 

We first need to make some preparations. Let Z — (Zn)neN and U = {U„)n<£N be indepen- 
dent sequences of i.i.d. random variables such that C{Zi) = Q with Lebesgue density / 
and Ui is uniformly distributed on (0, 1). For d G N, we denote the order statistics of 
0, C/i, . . . , f/d by =: Ud,o < Ud,i <■■■ < UdM- For each n G N, let i/„ be a Poisson random 
variable with parameter 7„ > 0, independent of Z and U. 

In both (2.4) and (2.5), N admits a representation N = J2k=i^iu,, k,Zk) since N is 
a Poisson measure with intensity 7„£(8) Q. On {f„ = 0}, we let AU,y^ = AG^^ = AG^^ = 0, 
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whereas on {vn > 0}, we set 

AG.„ = (G(C/.„,fc) -G([/.„,fe-))i<,<,^, 
AG.„ == (G(L/.„,fe) -G(C/.,.,fc-))i<,<,^. 

Let ^o = IRO = {0} and M = U^oi'^} x 'S'd x M'*, where for d G N, S'rf equals the set of aU 
w = {wi, . . . jWd)' € (0, l)** such that J2i=i ""^i ^ 1- We endow Sd and M with the Borel 
trace field B{Sd) {d > 0) and the cr-algcbra B, respectively, where B is the set of all S C R 
such that B n ({d} x Sa x W^) € {0, {d}} ® B{Sd) ® B{R'') for aU d e Nq. 

Since wc have assumed that Q C (0,oo) x [0,oo)'^, and since G and G always jump 
at the same time as N does, all arrival times are observed in full and thus £-y„,Q{Q) 
and f7„,Q(B) are equivalent to J-"„ and J-"„ in deficiency, respectively, in view of (A. 2), 
where for all n S N, we set 

J-„ = (R,6,(/:0K, AC/,„,AG,J)eee), 
Tn - (i,S,(£eK, A[/,„,AG,J)eee). 

Let wo = and for d>0, set Wd = (1/rf, . • • , 1/d) G M''. RecaU that 6 C (0, oo) x [0, oo)^ 
and pick dG No, ~ {ho,f3,a,X) &Q, w — {wi, . . . ,Wd) G 5*^ U {wd}- We define a diffco- 
morphism '^d.w.6 : M'^ ^- M'' as follows: if d = 0, then let '^d.w.e = 0; otherwise, if d > 0, let 

'i'dAu,e{z) ^ {hd,wM.ki^)^k)i<k<d, z^{zi,...,Zd)& M'', 
where for 2 <k < d, we recursively define 

hd,n,,0Az) = -(1 - e-"""^) + e-^'-Hl + ^4-i)hd.n,.e.k^i{z), 

" (6.1) 

hd.^,,9^{z) = hd.n,.0A = -(1 - C-"'"^ + C-^'^^/iO, 

a 
provided a > and otherwise, if a = 0, we set 

hdfi,w.k{z)^ (iwk + hd.e.w,k-i{z){l + Xzl_i), 

(6.2) 
hd,e,w,i{z) = hd,w,e,i = Pwi + h^. 

Let / be a strictly positive Lebesgue density of Q and set 

(|J . (a:)|/®'^(*,-.L.,,(x)))f(|J^ . (x)|/«'^(*-]„...(^)))'"'d.T 

for all 6*1,612 GO, 0<C<1, wG^dUJWd}. 

To summarize, we have thus far shown that for all n G N, equivalence of ip7„,Q(6) and 
£i„,q{Q) in deficiency is equivalent to equivalence of Fn and Fn in deficiency. For the 
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remaining part, recall that the two experiments are equivalent in deficiency if and only if 
their corresponding Hellinger transformations are equal (see [27], Corollary 53.8). By solv- 
ing the differential equations in (2.4) and (2.5), we thus arrive at the following identity: 

°° d —-y °° d —■y r J 



'Hd,eifi2,'wA0 = / , ^^ — / 'Hd,ei,92,wiC) 



for all 6*1, 6*2 e e, < C < 1, n e N. 

In the last display, the functions are analytical in 7„ ; consequently, for all d G N, 
01, 02 € 0, < C < 1, we must have 

'H-d,eu02.iSa{C) = '^d,ei,e2,j«(C)-7^T77rT- (6-3) 

Jsa (^^"(^d) 

Next, we return to the proof of the theorem. By our assumption, there exists Co G (0, 1) 
such that, with gf^c^^ : (0,00) — > [0, 1] as in (2.18), gf^o is strictly increasing on (0, 1]. As 
a result, hi—>- g/Xoi^h) is strictly increasing on (0, 1]. 

For aU 6*1, 6*2 G 6, define ifgi.Sa : (0, 1] -> (0,oo) by He^fi2{w) := ft.i,u;,e2,i(l)/^i,i",ei,i(l) 
for < w < 1. In particular, taking d~\ and C = Co in (6.3), wc must have 

5/,Co{v ^61,62(1)}= / 9f,c«{jHg^^gAw)}dw (6.4) 

"'(04) * 

for all 6*1, 6I2 GO. 

(i) and (ii) For i = 1, 2, let 9i = {ho,i,f3i, a, A) G 9. Then, 

hi „ e, i(l)e""'-^i?ei,e,(^«) = /^s Vi " A/io,2, 0<w<l. 
' ' aw 

(Note that this formula extends to a = 0.) If /3i = /32 > and /iq.i > /10.2, then Hg-^g^ is 
strictly increasing with Hg-^^g.^{l) < 1, contradicting (6.4) since ht-^ gfXg{^/h) is strictly 
increasing on (0,1]. If /io,i = ^0,2 and /32 < /3i, then Hg^^g^ is strictly decreasing with 
^01.92(0+) = 1, contradicting (6.4) since hi-^ gfXg{\/h) is strictly increasing on (0,1]. 
Reversing the roles of parameters by replacing Hg-^^g^ with Hg^g-^ , the previous reasoning 
extends to the remaining cases where either /3i = /32 > and /io,i < ft.0,2, or /lo.i = /io,2 
and 132 > Pi- This completes the proof of (i) and (ii). 

(iii) If (/io,/3,ai,A),(/io,/3,a2,A) G 9 and /3 = 0, then we have Hg^.g^w) =ci^i-o^2)w 
for all w G (0,1]. (Note that this formula extends to ai = or a2 = 0.) By the same 
arguments as in parts (i) and (ii), we get from (6.4) that ai= a2- 

(iv) In view of (iii), wc may assume that /? > 0. Contradicting the hypothesis, we 
assume that a2 > 0. It follows from the strict inequality c^ — 1 > a;, .x > 0, that 

{ho + Pwf4-Hg,,g2{w) = e-"^^"(«;(/3^ - aa/S/^o) - /^g«2 - — (e° 



du; "^'"'^ ' t '^ ^" "' "^ aa 

<-a2/ioe""''"(/io+w/3)<0 

for all w G (0, 1]. Thus, w H^ Hg^^g^w) is strictly decreasing on (0, 1] with Hg^_gAQ+) = 1, 
contradicting (6.4). Thus, we must have a2 = 0. 
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(v) Let (/io,/3,ai, A), (/io,/3,a2,A) e Q with a2 > ai. Without loss of generality, we 
may assume that /3 > 0. First, assume that /3/a2 ^ ho < /3/ai. Then, /3 — aiho > and 
l3 — a2hQ < 0. Note that wc cannot simultaneously have that /3 — aihg = /? — 02/10 = 
such that 



^i,w,,ei,i3— ^Si.eaCw) = {l3-a2ho)e 
aw 



'/ii,t«,ei,i -W- aiho)e "''"/ii,„,,e2,i < 



for all < ui < 1. Consequently, He-^^g^ is strictly decreasing with Ho^,o^{Q+) ~ 1, contra- 
dicting (6.4). Second, let /iq < P/a^ and set 

i:{w) := (/? - a2ho)hi,.^,e,,i - (/3 - ai/io)c-'"^-"^)"'/ii,^,e,,i, < w < 1. 

As we have a2 > ai and Hq < /3/a2, we must have that /3 ^ aiho > /3 ^ a2hQ > such that 

^'(w) = (ai - a2)(/3 - ai/io)e-'"^-"^'"/id,»,e.,i < 0, < «; < 1. 

Note that ?A(0+) = (ai - a2)/io < ^^'^ tl™s V(w) < for all < w; < 1. Since 
ga2«'/j2 ^ ^ ^^^g^ g_^(-y;) = ^(i/;) < for all < It; < 1, Hg^,g^ is strictly decreasing with 
^61,612(0+) = 1; contradicting (6.4). This completes the proof of (v). 

7. Proofs of the results in Section 2.4 

This section contains the proofs of Propositions 2.1-2.3. 

7.1. Proof of Proposition 2.1 

For / e Dd[0, 1], we write A/ = f(t) - /(t-), < t < 1, with the convention that A/(0) = 
0. With the usual convention inf = oo, we define T{f) = inf{i € [0, 1] : A/i(i) 7^ 0} A 1 for 
all / = (/i, /2) G D2. Let S be the set of all functions f € D2 with T(/) e (0, 1). Let L*^' C 
D2[0, 1] be the set of all functions / = (/i,/2) such that the right-hand derivatives /((0-I-) 
and /2'(0-t-) exist in K. Further, let D'^j, CSnD'f^ be the set of all functions / = (/i, /2) 
such that the right-hand derivatives fi{T{f)+) and f2{T{f)+) exist in R. 
Let f€D2 with T = T{f). If / e L>^' j, and /2(0+) ^ 0, then we set 



Xif) = 

^(/)=(l/2 



1/2(0)1, 



(/^(0+))^-./2(0)/^'(0- 



/^(0+) 



/^'(o- 



and /:^(r+) 7^ 0, then we set 
imT+)?-f2iT)f^'iT+) 



f^{T+) 






m^ 



|A/2(T)| 
(A/i(T))2 

|A/2(r)| 
'(A/i(r))^ 



If / e D'^j,, /'(0+) = /2(7'+) = and A/2(T) ^ 0, then we set 

|A/2(T)| 
(A/i(T))2 



^(/)= 1/2(0)1,0,0, 
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If / e D'^^, /' (0+) = /^(r+) = and A/2(r) = 0, then we set X{f) = (|/2(0)|,oo,c3o,0). 
If / € Dli\S and f^{0+) ^ 0, then we define 



^(/)= 1/2(0)1 



(/2(0+))'-/2(0)/^'(0+) 



m- 



/2'(0+) 



/^(o- 



For the remaining cases, we set X(/) = (|/2(0)|,oo,(X),cx)). Then, X:Di^ [0,oo]'^ 
is a X'2-'B([0,oo]'')-measurable mapping. Since (5({0}) = 0, it fohows from (2.4) that 
l-e{{G,h)) = Qe for all 9 e [0,00)^ and thus (5(5^, J") = by (A.2), where T is the ex- 
periment as defined in the assertion of the proposition. 

Next, wc show that 6{F,£h) =0. To this end, wc define ^= (^1, . . . ,^3) : [0,oo]* — )■ 
[0,00)'^ as foUows. Let cj = (wi, . . . ,aj4) G [0,oo]'*. If (wi, . . . ,^3) G [0,oo)'^, then wc set 
^(w) = (wi, 0,12,^3); if wi G [0,(X)) and cither L02 ^ 00 ov lo^ — 00, then wc set ^(w) = 
(aji,0,0); otherwise, we set ^(w) =0. 

In the notation of the Introduction, we define ^ : [0,oo)^ x M2 — > -D2, where, for < 
i< 1, w = (wi,a;2,a;3) G [0,cxd)3 and cr G M2, (/i(i),/2(0) = *['^,o'](0 is defined to be 
the unique solution of the system of the following integral equations: 

/ 1 /? 

/i(i)= / /2 (s-)2:icr(d.s,dzi,dz2), 

Jf0.tlxR2 

(7.1) 

f2{t)=UJi+ (W2 -W3/2(s-))ds+ / /2(s-)2:|CT(ds,dzi,dz2)- 

J[0,t] J[0,t]xRx(0,oo) 

Clearly, ^ is (,B ( [0, 00 ) '^ ) OA^ 2) /2?2 -measurable and thus defines a deterministic Markov 
kernel K2 : ([0,oo)3 x M2) x I?2 ^ [0, 1]. 

Let i/Q be the zero measure on B{[0, 1] x M^). For A > 0, let M\ be a Poisson measure on 
[0, 1] x R^ with the intensity measure '-^i® C{Z, A^/^Z), where C{Z) = Q and 7 > is the 
intensity parameter of N in (2.4). Consider the Markov kernel Ki : [0, 00]^ x (S([0, 00)'^) ® 
7W2) ^ [0, 1] defined by 

i^i[(a.i,c.2,W3,W4),-] =£CM ® |£(Af^jAf^, ^^.o), C.4 <oo. 

Observe that K2K1Q9 = /^^(G', h) for aU 6* G [0, oo)^, in view of (2.4). Hence, <5(J', £,,) = 0, 
by (A.2). 

To summarize, wc have shown that £h is equivalent to T in deficiency. By means of 
similar arguments, we can show that A(T,£h) = 0. 

7.2. Proof of Proposition 2.2 

(i) Let Hn = -ffi°^ : [0, oo)^ -^ [0, oo)^ be as defined in (2.11)-(2.12) and define ij„ : [0, oo)^ - 
M:={{xi,X2,X3)€ [0,oo)2 X (0, 1] :a;i > xz} by 

Hniho,l3,a) = {ho^n{ho,P,a,0),l3n{ho,l3,a,0),a„{ho,l3,a,0)), 
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ho,P,aG [0,oo). Then, ff„ : [0, cxd)^ -> Tlf x [0,oo) and F„ : [0,oo)^ -)► M are both bijec- 
tions with inverse functions H~^ : M x [0, oo) — >■ [0, oo)* and H~^ : M — >• [0, oo)^, respec- 
tively. Define H^-.R^ ^ [0,oo)3 and F„:R4 ^ [0,cx))4 by F„(xi,X2,X3) = H-^{\xi\ V 
|a;2|,|a;2|,|a;3| A 1) and Hn{xi,X2,X3,X4:) ^ H~^{\xi\ V |a;2|, |a;2|, |a;3| A 1, |x4|) for xi,a;2, 
xa, a;4 € M with xa ^ 0. 

In the sequel, we write x = (a;(fc))o<fc<n for a generic element of M"+^. Fix n > 5. Let 
A^o,« C [R"+i]2 be the set of all {x,y) such that both y(0) 7^ j/(l) and y{l) ^ y(2). By 
employing the convention inf = cx), wc define Tn : [M"+"'^]^ ^{l,...,n + l} by 



r„(x,y) =inf{l < fc < n:a-(fc) ^a-(fc - 1)} A 1, {x,y) G 



pn+ll2 



Let S„ be the set of all {x,y) e [M"+i]2 with 3 < T{x,y) <n-2 such that x{T) = 
x{T + 1) = x{T + 2). Consider the subset Afr.n ^ S'„ of all {x,y) £ [M"+i]2 such that 
both y{T) ^ y{T + 1) and y{T + 1) ^ y{T + 2) are satisfied. 

For all n > 5, we define a mapping X„ : [R"+i]2 -j> [0, 00]'' as follows: fix {x, y) e [M"+i]2 
and set T = Tn{x, y). If {x, y) € SnC\ Mo,„, then set 

y{lf^y{Q)y{2) y{2) - y{l) 



Xn{x,y)^HAy{Q) 



2/(1) -y(0) '2/(l)-y(0)' 
y(T) y(l)2 - y(0)2/(2) + y{T - l)[y{2) - y{l)] 



[y(l)-y(0)][x(r)-x(T-l)]^ 



V{T) 



[x{T)~x{T-l)Y 

If {x,y) e AfT,n\Mo,n, then set 

Xn{x,y) 

^fr ( (r.. y{T+lf-y{T)y{T + 2) y{T + 2) - y{T + I) 

"^^^ ^' y{T+l)-y{T) ' y(r + l)-y(T) ' [x(T) - x(r - 1)]2 

_ yiT + lf~y{T)y{T + 2) + y{T-l)[y{T + 2)- y{T + 1)] ' 
[y{T+l)-y{T)MT)-x{T-l)]^ 

If (x, y) G S'„\(Mo,„ U A/t.„) and y{T) ^ y{T - 1), then set 

\y{T)-yiT-l)\ 



Xnix,y)=[y{0),0,0 



{x{T)-x{T~l)y 



If (x,2/) e 5„ \ (Afo,„ U A//t,„) and y(T) = 2/(T - 1), then set X„(a;,y) = (|2;(0)|,(X3,oo,0). 
If {x,y) e Afo.nX'S'n and T = n + 1, then set 



X„(x,y)= H, 



y(o) 



2/(1)2 -y(0)y(2) y(2)-j/(l) 



2/(1) -2/(0) 'y(l)-y(0) 
Otherwise, set X„(a;, 2/) = (|2/(0)|, 00, 00, 00). 



, 00 
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Recall that both G„ = {Gn,k)o<k<n and /i„ = (/i„,fc)o<fc<n are defined by (2.3) 
via (2.11) and (2.12). For n>5, the mapping X„:[M"+i]2 ^ [0,0©]^ is well defined 
and S([M"+i]2)/S([0,(X)]4)-nieasurable. Recall that Q„({0}) = for all neN and 
thus 

' 'Zl,ne(/io,;9,Q,oo) + q2,n£e 

+ (1 — qi,n — q2,n)S(ho,„(0),oc,ao,oc), ^ ©e, 

£f " {Gn,hn) = N (1 ^ 92, n)£(/io, 00,00,00) + <?2,n£0, 6* S Og, /iQ > 0, A > 0, 

(1 - 92,n)£(/io,oo,oo,oo) + 92,n£(/io,oo,oo,0) i ^ ^ ©c^O > 0, A = 0, 

^£(0,00,00,00)7 0e9e,/io = O 

for all n >5, 6 = (/io,/3,a, A) G [0,oo)^, where we set qi^n = (1 ^ Pn)" and (72, n = (1 ~ 

Pn)'[l -P« -Pn(l -P„)][l - (1 -Pn)"-"]. 

On the other hand, define a mapping ^„ = (^i.„, . . . ,^4_„) : [0,oo]^ — >■ [0,oo)^ as fol- 
lows. Let uj = (cji,. . .,a;4) G [0,oo]''. If w g [0,oo)^, then set ^n(a;) = Hn{uj). If w G 
[0,oo)^x {00}, then set ^„(w) = (/f„(wi, 0^2,^3), 0). If w e [0,oo) x ({00} x [0,oo]U [0,oo] x 
{00}) X [0,00), then set Cn(w) = (a;i,0, 1,0^4). If w € [0,oo) x ({00} x [0,oo] U [0,oo] x 
{od}) X {00}, then set ^„(a;) = (wi, 0,1,0). Otherwise, set ^(w) = 0. Define a Markov 
kernel ifi^n: [0,00]'' xe([0,cx3)3x [M"]2)^[0,l] by 

Ki^n[ui, ■] = £(C„.i("),C„,2(<^),C„,3(<^)) 

So, a;4 = oo, 

mZn,k)k, {U^)Zl^)k\{Z„^k)k + 0), W4 < 00 

for bj = (a;i,a;2,a;3,aj4) £ [0,oo]'^, where Z„ = {Zn,k)k is the random vector with the dis- 
tribution as specified by (2.1). 

Also, let /i:2,n: [0,oo)3 x [R"]^ x B{\EJ^+^Y) -^ [0,1] be the Markov kernel defined by 
the deterministic mapping (Ci, ^2,^3, 2^1,^2) H> {x,y), where we recursively set a;(0) =0 
and y(0) = ^1, and for 1 < fc < n, 

x{k) = x{k - 1) + yi/2(fc - l)zi(fc), y{k) - 6 + y{k - l){i^ + Z2{k)). 

For n > 5, let Fn = ([0,cx)]^,S([0,oo]^), ('C^"(G„,/i„))eg[o^oo)-i)- By construction, we have 
6{£h,n,J'n) = 0, by means of (A. 2). For all n > 5, observe that 

S{J'n,£h.n)< sup \\Ce{Gn,hn)- K2,nKi^nC'g"{Gn,hn)\\ 
8e[0,oo)3 

< |1 ~qi,n -92,n| + |1- (1-P«)" -92,n| 

and thus Sh.n is strongly asymptotically equivalent to J>j as n — > 00, by means of (A. 2) 
and (2.2). By (A. 4), J^„ converges strongly to the experiment T in the assertion of 
Proposition 2.1, completing the proof of (i). 

(ii) This follows from the same arguments as in (i). 
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7.3. Proof of Proposition 2.3 

(i) Define X,X„:[0,oo]'^ -^ [0,oo]'' as follows. If a; = (wi, . . . , W4) e [0,oo)3 x {00} 
such that wiws = W2, then set X^uj) = (wi, 00,0x3, 00); otherwise, set X{ui) = uj. If 
w = (wi,...,u;4) e [0,C!o)^ X {00} such that LOin{l — e"'^^/") = ^2, then set X„(w) = 
(wi, 00, cxj, 00); otherwise, set Xn{uj) ~ lo^ n G N. 

By definition, the deficiency is non-decreasing in the parameter set with respect to set 
inclusions. Further, we have Q-^ = Qg and Qg" = Qg.n for all n € N and thus, by (A. 2), 
6{T{e),T{e)) < S{f,T)^Oand 5(^(6), J'„(e)) < 6{T,Tn) = for aU n e N, complet- 
ing the proof of (i). 

(ii) First, assume that Q satisfies (2.24) for all x > 0. Without loss of generality, we may 
assume that Q C [0,oo)'^ is a finite set (see [27], Theorem 51.4). Define fie to be the set of 
aU UJ = (cJi,W2,W3,a;4) G (0,oo) x {00}^ x {0,oo} such that (aji,/3,a. A) £ (6n8e)\6e for 
some (/3,a, A) G [0,oo)'^. li uj ~ {uti, 0)2,(^3,004) G ^ie, then it follows from (2.24) that the 
corresponding pair (/3,a) = (/3(aJi), a{uJi)) S [0,oo)^ is uniquely determined by uji. Hence, 
we may define a mapping Y : [0,oo]^ -^ [0, 00]^ as follows: if w = {uji,uj2,u!3,u!4) S Oe, then 
we set Y{llj) = (a;i,/3(aji),a(a;i), ^4); otherwise, if w G [O,oo]^\rie, then we set Y(uj) =oj. 
As Q, and thus Hq, is a finite set, the mapping Y is i3([0,oo]^)/K([0,oo]^)-measurable. 
In view of (2.24), note that Qj = Qg for all 6* G 6 and thus (5(J'(e),-F(e)) = 0, by (A.2). 

Second, assume that (2.24) is violated. There then exist some ho > 0,6i — (/io,/3i,a2, Ai) 
e 9n6ene^ and 6*2 = (/io,;52,a2, A2) G 6 n 6e such that (/?i,ai) ^ (/32,a2)- 

Consider 0o = {di,92} and the decision space D = {(/?i,q;i), (/32,Q;2)}, endowed with 
the discrete topology. For 9 = {ho, 13, a, X) G 8, consider (continuous and bounded) 
loss functions Wg : D -^ M, where for x ~ {xi,. . . ,X4) G [0,oo]'*, we set Wg{x) = 1 — 
^{(I3,a)}{x2,x3)- Further, we define a Markov kernel p:[0,oo]^ x B{D) -^ [0,1], where 
for X G [0, oo]"^ and B G B{D), we set 

'^ \e(/32,a2)(^)> otherwise. 



We then have J Wg.{x)p{Ld,dx)Qg-{duj) = for i = 1,2. On the other hand, any 
Markov kernel p:[0,oo]'^ x B{D) -^ [0, 1] is of the form p(w,B) =p(a;)e(^,,„,)(S) + (1 - 
p(w))e(^2^„2)(^)' where p: [0,oo]'' — > [0, 1] is Borel, uj G [0,oo]^ and B G B[D). It is easy to 
see that for such a Markov kernel p, there exists a Markov kernel p : [0, 00]^ x B{D) — >■ [0, 1] 
such that both 



/■ 



Wg^{x)p{ijj,dx)Qg^{dijj) >e '''(1 -p(/io,oo,cx),cx))) 

and 

Wgj {x)p{uj, dx)Qg2 (do;) > e~'^p{ho, 00, 00, 00). 



In view of (A.l), we thus have S{T{Q),T{e)) > 5(J'(eo), ^"(60)) > c-''/2, which com- 
pletes the proof of (ii). 

(iii) This follows by the same arguments as in (ii). 
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Appendix 

Here, we collect necessary facts regarding Le Cam's distance in deficiency. The reader 
is referred to Le Cam [18], Le Cam and Young [19] and Strasser's monograpli [27] for 
unexplained notation encountered in this section. Let be a non-empty set, {E,A) be 
a measurable space and (-Pe)eee be a family of probability measures on A. The triplet 
£ = {E,A,{Pe)9i£0) is then called a (statistical) experiment. Consider two experiments 
£i = {Ei,Ai, {Pi^g)g^e), i = 1,2, indexed by 8. A decision problem is a triple {Q,D,W), 
where D is a topological space and W = {Wg)g^s is a loss function We : 13 — > M, 6* e 9. Let 
II W^ejloo = supj;g£) |W6i((i)|. Also, let e > 0. Then, £i is called e-deficient with respect to £2, 
notated as £1 'D^ £2, if and only if for all decision problems (9, D, W) with W continuous 
and bounded, and all /32 € 13{£2, D), there exists some /3i € B{£i,D) such that 

(3iiWe,Pi^0)</32{Wg,P2^9)+4W9\\oo, OeO, 

where B{£i,D) [i = 1,2) is the space of generalized decision functions (see [27], Defini- 
tion 42.2). The deficiency of £1 with respect to £2 is the number 

5{£i,£2) = n-vi{e>Q:£i^,£2}. (A.l) 

The relation £1 D^ £2 is interpreted in the following sense: we have £1 ^e £2 if £1 is more 
informative than £2 uniformly over all decision problems with continuous and bounded 
loss functions up to some error e. Two experiments £1 and £2 are called equivalent in 
deficiency if and only if £1 'Dq £2 and £2 3o £i- 
Recall that (see [27], Lemma 55.4 and Remark 55.6(2)) 

5{£i,£2) =irvisM\P2,e - KPi.eW (A.2) 

with infimum now taken over all Markov kernels K : Ei x £2 ^ [Q,l\. 

Le Cam's distance between £1 and £2 is a pseudo-metric on the space of all experiments 
indexed by 9 (see [27], Corollary 59.6), defined by setting 

A(fi,£2)=max{J(fi,f2),<5(f2,fi)}. (A.3) 

If {Ei,Ai) = {E2,A2), then we have (see [27], Corollary 59.6) 

A(£i,£-2)<sup||Pi.e-P2.e||. (A.4) 

Clearly, if £1 and £2 are two experiments indexed by the same 9, then £1 is equivalent 
to £2 in deficiency if and only if /S.(£i,£2) = 0. Let £, £"„, J-"„, n G N, be experiments, all 
indexed by 9. We then say that £n converges (strongly) in deficiency, or that £n and Fn 
are (strongly) asymptotically equivalent in deficiency, if and only if A(£„,£) — )■ and 
A(£'„, J>i) — !• 0; respectively, as n — >■ 00. 

For 7^ 9o ^ 9, we employ the notation £(9o) = {E,A^ {Pe)e^Ba) fo^' corresponding 
subexpcriments oi £ = [E,A, {Pe)e<£e)- We refer to weak convergence and weak asymp- 
totic equivalence in deficiency if and only if, for all non-empty and finite 9o ^ 9, the 
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corresponding subexperiments converge strongly and are strongly asymptotically equiv- 
alent in deficiency, respectively. 
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